Nucleic acids encoding recombinant soluble endoproteases

ABSTRACT

The present invention relates to the expression and secretion in  Saccharomyces cerevisiae  of readily purifiable soluble variants of the Kex1 endopeptidase of  Kluyveromyces lactis  and the purification and use thereof for the in vitro processing of recombinant proteins usable in industrial applications. The soluble Kex1 endoproteases described here are free from the transmembrane domain of the native enzyme; the deletion of the transmembrane domain is achieved by removing at least 57 amino acid residues from the C-terminal.

CROSS REFERENCE TO PRIOR APPLICATIONS

This application is a divisional of U.S. patent application Ser. No. 11/804,557, filed on May 18, 2007, which is a divisional of U.S. patent application Ser. No. 10/181,277 filed Jul. 11, 2002, now U.S. Pat. No. 7,241,609, which application is a U.S. national phase of International Application No. PCT/EP01/13257, filed Nov. 14, 2001, which application claims priority from Italian Application No. MI2000A002465, filed Nov. 16, 2000. The disclosure of all applications are incorporated herein by reference in their entirety.

The present invention relates to novel endopeptidases having a high cutting specificity to be used in the in vitro processing of recombinant proteins usable in industrial applications and, more particularly, soluble secreted derivatives of the Kex1 protein of Kluyveromyces lactis and the use thereof, in a soluble form or in a form immobilized on an insoluble organic or inorganic support, for the release of the protein of interest from fusion proteins.

FIELD OF THE INVENTION

The manufacture of heterologous recombinant proteins in suitable expression systems is one of the main applications of industrial interest of recombinant DNA technology. Owing to the instability of the peptide/protein in the host cell, it is often advantageous to manufacture the peptide/protein of interest in the form of a fusion protein comprising a protective (or stabilising) protein which will subsequently be processed at a specific predetermined site in order to free the desired protein or peptide.

That process also enables the desired protein or peptide to be obtained without the extension of an N-terminal methionine residue which constitutes the first amino acid residue of proteins expressed in bacterial systems. Fusion proteins are also manufactured with the aim of increasing expression levels or of facilitating the purification process by the selection of suitable polypeptide sequences, to the amino- or carboxy-terminal ends of which the protein of interest is attached.

The success of that strategy requires the availability of chemical or enzyme reagents that can effect the processing of the fusion protein with the desired level of specificity, ideally without releasing any secondary products, so as to reduce the cost of downstream processing. Of the chemical methods proposed, mention may be made of cutting at methionine residues with CNBr, acid hydrolysis at the Asn-Pro dipeptide or cutting with hydroxylamine at the Asn-Gly dipeptide (Fontana et al.; Practical protein chemistry. A handbook. pp. 5569-575, 1986). The enzyme methods make use, for example, of lysil-endopeptidases (Achromobacter protease I) which specifically cut the peptide bond from the carboxyl end of the lysine and protease V8 from staphylococcus which specifically cuts the peptide bond from the carboxyl end of glutamic acid (Japanese Examined Patent Publication No. 6-87788). However, because those chemical methods and the endoproteases recognise a single amino acid residue, it is necessary for the amino acid residue not to be present in the desired peptide in order to permit the efficient excision of the desired peptide from the chimeric protein; thus, the peptides that can be manufactured are limited. Various studies have therefore been directed to the search for and/or engineering of proteases having substrate specificity suitable for their use in the specific proteolytic cutting in vitro of recombinant proteins of industrial interest. Other endoproteases having a cutting sequence which is more restricted, that is to say, which covers a sequence of amino acids and not a single residue, have been used and include, for example, thrombin, factor Xa and enterokinase (Nilsson et al., Current Opinion Struct. Biol. 2, 569-575, 1992).

More recently, considerable interest has been concentrated on some members of the family of the subtilases, or serine proteases, the more well-known members of which are: (1) bacterial subtilysines, (2) the Kex2 protease of S. cerevisiae, (3) furine and furine-analogue proteins. In particular, while bacterial subtilysines do not exhibit sufficient cutting specificity for the in vitro processing of fusion proteins, except in suitably engineered variants (Ballinger et al., Biochemistry 35, 13579-13585, 1996; U.S. Pat. No. 5,837,516), both the Kex2 protein and furines exhibit a cutting specificity adequate for the purpose.

Subtilases having a high cutting specificity, which will be referred to hereinafter by the term kexino-analogue proteases, play the molecular/physiological role of “pro-hormone convertases”; that is to say, they are enzymes that produce peptide hormones from their precursors in vivo, by proteolytic cutting at the C-terminal end of pairs of base residues, Lys-Arg or Arg-Arg. FIG. 1 shows diagrammatically the structure of kexino-analogue proteases. The subtilysine-analogue catalytic domain which extends for approximately 330 amino acids is highly conserved between the eukaryotic pro-protein convertases. In particular, the residues of the active site which constitute the catalytic triad (Ser-His-Asp) and a residue of Asn which stabilises the oxyanionic cavity in the transition state, are present in corresponding positions in all members, except in PC2, where the Asn residue is substituted by an Asp residue (Bryan et al., Proc. Natl. Acad. Sci. U.S.A., 83: 3743-37451986).

The sequences flanking those residues are also well conserved. In addition, the region of 140 amino acids which follows the catalytic domain, known as “Homo B”, “P” or “middle” domain, is also well conserved between the eukaryotic convertases, including the yeast protease Kex2, but is absent from bacterial subtilysines. The Homo B domain is essential for catalytic activity (Zhong et al., FEBS Lett., 396: 31-36. 1996). That domain contains a conserved Arg-Gly-Asp sequence which resembles the recognition sequence of integrins. The mutation of one of those three residues in PC1/PC3 causes the loss of catalytic activity and the incorrect directing of that neuroendocrine convertase towards the constituent secretive pathway (Lusson et al., Biochem. J., 326: 737-744 1997). Another conserved region is the propeptide, which is removed autocatalytically by processing the Arg-Xaa-Lys-Arg site during the maturing of the convertases. Towards the C-terminal end, furine, PACE4, PC5/PC6A and B have a domain rich in Cys, which is well conserved and the role of which has hitherto been unknown. Furine, Kex2, PC5/PC6B and LPC/PC7/PC8/SPC7 also have a transmembrane domain near the C-terminal end.

The Kex2 endoprotease of the yeast Saccharomyces cerevisiae was the first enzyme demonstrated, to which genetic and biochemical proofs have permitted the ascribing of the function of processing pro-proteins at di-base sites in the trans-Golgi network.

The KEX2 gene codes for a glycoprotein of 814 amino acid residues with a molecular mass of from 100 to 120 kDa. That glycoprotein is anchored to the membranes of the trans-Golgi network and has the physiological role of processing the α-factor and the killer toxin. The Kex2 protein is a calcium-dependent protease serine which specifically cuts peptide bonds at the C-terminal end of Lys-Arg, Arg-Arg and Pro-Arg sequences (Mizuno et al., Biochem. Biophys. Res. Comm. 144: 807-814 1987). The amino acid sequence of Kex2 contains, in the NH₂-terminal portion, a pre-pro-domain having potential autoproteolytic sites (Lys₇₉-Arg₈₀, Pro₁₀₂-Arg₁₀₃ and Lys₁₀₈-Arg₁₀₉) followed by a region (144-438 aa) having a high degree of sequence homology (identity of 30%) with bacterial subtilases (Mizuno et al., Biochem. Biophys. Res. Commun., 156: 246-254 1988 ; Fuller et al., Proc. Natl. Acad. Sci. USA, 86: 1434-1438 1989). The Kex2 protease is expressed in the MATα c cells of the yeast S. cerevisiae as an inactive precursor and the NH₂-terminal region of the mature form is generated by proteolytic processing at the Lys₁₀₈-Arg₁₀₉ site, followed by proteolytic cutting of the dipeptides Leu-Pro and Val-Pro by the dipeptidyl-aminopeptidase enzyme Ste13 (Brenner and Fuller, Proc. Natl. Acad. Sci. USA, 89: 922-926 1992).

The catalytically active Kex2 protease of S. cerevisiae was purified from yeast as a secreted soluble enzyme (ss-Kex2) generated by the insertion of a stop codon before the COOH-terminal transmembrane domain. This modification facilitated the purification of the enzyme. Studies on the substrate specificity of the purified ss-Kex2 S. cerevisiae protease have demonstrated that the protease does not exclude any residue in the P3 position except for Asp and Pro, but is, on the other hand, selective for amino acids in the P2 and P1 position. In particular, Kex2 in the P2 position would exclude amino acids having bulky chains, but would, however, recognise those having positively charged residues while, in the P1 position, the Kex2 endoprotease is extremely selective both for the charge and for the structure of arginine. EP-327377 describes Kex2 proteases of S. cerevisiae that are free from the C-terminal hydrophobic region; those proteases are soluble in water and can be secreted outside the cell, leaving the enzyme activity of the native protein unaltered.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the structural organisation in domains of the Kex2 protein, a typical representative and the first identified member of the family of the kexin-analogue processing subtilases.

FIG. 2 shows the alignment of the protein sequences of Kex2 from S. cerevisiae (SEQ ID NO: 4) and Kex1 from K. lactis (SEQ ID NO: 3).

FIG. 3 shows the differences in thermal stability of the proteases ss-Kex2 from S. cerevisiae and ss-Kex1 from K. lactis.

FIG. 4 shows the activity profile, including the optimum functioning temperature of the proteases ss-Kex2 from S. cerevisiae and ss-Kex1 from K. lactis.

FIG. 5 shows the elution profile in RP-HPLC of a purified preparation of a soluble ss-Kex1 protease of the present invention.

FIG. 6 shows the in vitro digestion kinetics of a fusion protein showing a reaction yield higher than 95% obtained with a soluble ss-Kex1 protease of the present invention.

FIG. 7 shows the in vitro digestion profile of a fusion protein obtained with an immobilised ss-Kex1 protease of the present invention that can be re-used in the freeing of the protein of interest from the fusion partner.

DETAILED DESCRIPTION OF THE INVENTION

Some strains of Kluyveromyces lactis exhibit a killer phenotype secreting a toxin capable of killing sensitive cells of various species of yeast. That killer property is due to the presence of two plasmids of linear DNA, pGKL1 and pGKL2, which cooperate to produce the toxin and a component which permits the immunity thereof. A mutation (kex1) was isolated at a single locus of the K. lactis chromosome, called KEX1, which mutation leads to the loss of the killer phenotype, while enabling the above-mentioned plasmids to be maintained.

Cloned by genetic complementation of the above-mentioned mutation, the gene KEX1 was found to be orthologous to the gene KEX2 of Saccharomyces cerevisiae. The proteins encoded by the genes KEX2 of S. cerevisiae and KEX1 of K. lactis have a high degree of sequence homology (identity >50%) and each of them is capable of compensating for the deficiency of the other. That information suggests that the KEX1 gene of Kluyveromyces lactis (Wesolowski-Louvel et al. Yeast 4: 71-81 1988) codes also for a protease that has cutting specificity for di-base residues and that is involved in the production of the mature form of the killer toxin generated by processing its precursor. However, hitherto no data have been available relating to the expression of the product of the KEX1 gene, nor have any data been available concerning the biochemical and enzymatic characterisation of that protein.

Recombinant Kex1 proteases of K. Lactis, constituting the principal object of the present invention, have now been found that are free from the transmembrane domain (and will be referred to by the term ss-Kex1) and that manifest in vitro endopeptidase catalytic activity in respect of substrates containing Pro-Arg residues and di-base residues, in particular, towards Arg-Arg, Lys-Arg residues.

It has also surprisingly been found that those recombinant Kex1 proteases, the protein sequence of which, except for deletions comprising the domain for anchoring to the membrane (possibly except for deletions comprising the domain for anchoring to the membrane) corresponds to that encoded by the KEX1 gene of K. Lactis, demonstrate substantially greater thermal stability compared with proteins of the same family, the protein sequence of which corresponds to that encoded by the KEX2 gene of S. cerevisiae, so that they can be used in processes of targeted proteolysis in vitro under conditions in which other proteins of the kexine family, in particular Kex2 from S. cerevisiae, are inactivated.

The soluble Kex1 proteases according to the present invention are characterised by a greater stability compared with the corresponding variants encoded by the KEX2 gene of S. cerevisiae, the soluble Kex1 proteases preserving more than 50% of their activity under conditions in which the Kex2 protein is completely inactivated.

As will be appreciated from the following, in the soluble Kex1 endoproteases according to the present invention, the deletion of the transmembrane domain is achieved by removing at least 57 amino acid residues from the C-terminal of the enzyme encoded by the KEX1 gene of K. lactis; in the preferred embodiment of the invention, at least 57 and not more than 100 amino acid residues are removed from the C-terminal; the gene sequence used experimentally is that available from the EMBL data bank under the accession number X07038.

In particular, the soluble Kex1 endoprotease according to the preferred embodiment of the present invention is characterized by having sequence SEQ ID NO 2 or, in any case, by having a sequence with a 90% homology, preferably a 95% homology, with respect to SEQ ID NO 2.

A second object of the present invention is represented by a DNA molecule coding for a soluble Kex1 endoprotease according to the present invention and, in particular, coding for a soluble Kex1 endoprotease in which at least 57 amino acid residues have been removed from the C-terminal and, preferably, coding for a soluble Kex1 endoprotease in which at least 57 and not more than 100 amino acid residues have been removed from the C-terminal.

According to a preferred embodiment said DNA molecule is characterized by (a) having sequence SEQ ID NO 1, (b) having a sequence which hybridize to SEQ ID NO 1, (c) having a sequence which is degenerate as a result of the genetic code to the DNA having sequence SEQ ID NO 1 and/or (d) having a sequence with a 90% homology, preferably a 95% homology, with respect to SEQ ID NO 1.

A further object of the invention is constituted by a process for the manufacture of a biologically active substance of interest (a peptide or a protein), comprising:

-   -   (a) the production of a fusion polypeptide or protein comprising         the sequence of the peptide or protein of interest in a         construct of the type NH₂-A-B-C-COOH, where NH₂ is the         NH₂-terminal end of the fusion polypeptide or protein, A is any         desired protein or polypeptide (optionally only the methionine         encoded by the start codon of the translation), B is a joining         “linker” fragment terminating with an amino acid sequence         recognised by the Kex1 endoprotease of the present invention, C         represents the polypeptide or protein of interest, so that the         first amino acid of the mature protein of interest is         immediately at the carboxyl end of the sequence recognised by         the Kex1 endopeptidase according to the present invention and         COOH represents the carboxy-terminal end of the fusion         polypeptide or protein;     -   (b) the in vitro incubation of the fusion polypeptide or protein         mentioned in step (a) above in the presence of a Kex1         endoprotease according to the present invention, in order to         separate the protein or polypeptide of interest from the fusion         partner and with the free NH₂-terminal end corresponding to the         NH₂ terminal end of the polypeptide or protein of interest;     -   (c) the separation and purification of the biologically active         peptide or protein of interest.

The procedures for carrying out operations (a), (b) and (c) listed above are well known in the art and therefore do not have to be explained in detail; they are described, for example, in documents EP-327377, EP-794254, EP-794255 and U.S. Pat. No. 5,077,204, the content of which is to be regarded as an integral part of the present description.

The endoproteases according to the present invention can be used to process fusion proteins containing peptides or proteins of industrial and/or therapeutic interest; possible proteins obtainable using the process according to the present invention are, for example, recombinant proteins for therapeutic use, such as, for example, peptide hormones, interleukins, interferons, cytokines, growth factors, fibrinolytic enzymes and recombinant enzymes of industrial interest which can be used as biocatalysts, such as, for example, lipases, hydrolases, nucleases, oxidases, phosphatases.

Other objects of the invention are represented by the DNA sequences which code for the above-mentioned proteases, by the corresponding expression plasmids and by the host cells containing them.

The Kex1 protease of Kluyveromyces lactis according to the present invention was obtained by causing variants of the Kex1 protease that are free from the domain for anchoring to the membrane to be expressed in the Saccharomyces cerevisiae yeast. The secretion in the growth medium of the ss-Kex1 proteases was achieved either by means of the homologous pre-pro region or by means of the pre-pro region of the homologous Kex2 protein of S. cerevisiae, as well as thanks to the (pre) signal sequence of the glucoamylase of Saccharomyces cerevisiae var. diastaticus. Owing to those studies, it was surprisingly found that the catalytic domains of Kex1 and Kex2 have functional differences as demonstrated by the observation that the Kex1 protein terminating at residue 580 is catalytically inactive, unlike the Kex2 protein terminating at residue 593, which is the corresponding residue in the Kex2 protein (Gluschankof et al., EMBO J., 13: 2280-2288 1994). Growth conditions more suitable for the production and secretion of the protease in a flask were also sought.

Soluble Kex1's were purified from the growth medium and some of their biochemical properties were determined, including the cutting efficiency on model substrates and the stability and catalytic activity in the presence of various physico-chemical agents, such as the temperature.

The ss-Kex1 proteases so obtained can be used as such or can be immobilised on an insoluble organic or inorganic matrix; the proteases so immobilised can be reused several times in the in vitro processing of fusion proteins of industrial interest, with evident advantages.

The thermal stability of the novel soluble secreted ss-Kex1 endopeptidases is such that they maintain approximately 60% or more of their original activity after incubation at 50° C. for 6 minutes, under which condition the corresponding Kex2 proteins lose their activity completely.

The estimated molecular mass of the novel ss-Kex1 endopeptidases of the present invention is approximately from 65 to 70 kDa (enzyme produced from the strain NP31) and from 70 to 80 kDa (enzyme produced from the strain NP168) as determined in SDS-PAGE after heating and reduction of the sample.

The invention will be explained hereinafter by specific examples. The examples have the sole purpose of clarifying the invention, although it should be appreciated that the invention is not in any way limited thereby.

EXAMPLES Example 1

Construction of Plasmids for the Expression of Kex1 from Kluyveromyces Lactis in in Saccharomyces Cerevisiae

Unless otherwise indicated, conventional procedures were used for all the standard manipulations of the recombinant DNA. A collection of those procedures is given, for example, in Sambrook et al., (1989) Molecular Cloning.

In order to construct plasmids for the expression in S. cerevisiae of truncated variants of the COOH-terminal region of the Kex1 protease of K. lactis, under the control of various signal sequences, PCR's were carried out with suitable mutagenic oligonucleotides, standard subcloning procedures being followed. The expression vectors used in this invention were pEMBLyex4 (Baldari et al, EMBO J., 6: 229-234 1987), pVTU (Vernet et al., Gene 52: 225-233 1987) and YEPSTA (Martegani et al., Appl. Microbiol. Biotechnol., 37: 604-608 1992). The plasmids expressing kexines from those derivatives are listed in Table 1. On the basis of the information given in that Table, it will be possible for experts in the art to construct the above-mentioned plasmids without excessive experimentation. In this patent, reference is made to the following sequences of proteins and nucleic acids: (KEX1 gene, EMBL accession number X0738; Kex1 protein, Swiss-Prot accession number 09231; KEX2 gene EMBL accession number M22870; Kex2 protein, Swiss-Prot accession number P13134). In the following text, nucleotide 1 refers to the adenine nucleotide (A) corresponding to the translation start codon (ATG) of each given gene coding for proteases; similarly, amino acid 1 refers to the methionine encoded by said start codon. Analogously to the description given above for Kex2, the soluble and enzymatically active mature forms of Kex1 described in the present invention are initially expressed in the form of inactive pre-pro-proteins which are subsequently processed at the Lys101-Arg102 site (when the homologous pro-protein is used) or at the Lys-113-Arg-114 site (when the kex2 pro-protein is used). For the purposes of the present invention, the soluble and enzymatically active mature forms of Kex1 are characterised by having a polypeptide chain between amino acid 103 and amino acid 643 of the sequence given in FIG. 2 and are indicated by ss-Kex1-Cx terminology, where x represents the amino acid residue of the sequence of FIG. 2 which constitutes the carboxy-terminal residue of the soluble forms of Kex1. Purely by way of example, the construction of the PL24 plasmid which expresses the Kex1-C600 protease is given in detail. Since the DNA sequences are known, the construction of those plasmids is within the ordinary competence of an expert in the art and can also be carried out by means of strategies and techniques which differ from those exemplified but which are equivalent in terms of obtaining the expected product(s).

In order to construct the plasmid for the expression of the Kex1-C600 truncated at amino acid 600, which residue corresponds to amino acid 613 in the Kex2 protein encoded by the KEX2 gene of S. cerevisiae, the sequence encoding residues 1-600 of Kex1 was amplified by means of PCR with the oligonucleotides indicated below (the nucleotides constituting the restriction sites for BamHI and HindIII are underlined):

SEQ ID NO:5 5′CGC GGA TCC ATG ATC C TA TCG TCG CAG C3′

SEQ ID NO:6 5′C CCC AAG CTT TCA TTC AGC ATC CTC TTT GTC3′

At the end of the PCR, the amplified fragment was subjected to restriction with the endonucleases BamHI and HindIII, purified from agarose gel after preparative electrophoresis and subcloned in the expression vector pEMBLyex4 which had been linearised beforehand with BamHI and HindIII, thus allowing plasmid pL24 to be obtained which enables the Kex1-C600 protein to be expressed in yeast under the control of the inducible hybrid promoter GAL1-10-CYC1.

TABLE 1 Plasmids used in this invention expressing various forms of kex1 Parent Pre Pro Plasmid Marker(s) plasmid Promoter sequence sequence ss-Kex1-Cx PL24 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex1₁₋₂₄ Kex1₂₅₋₁₀₂ Kex1-C600. PL57 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex1₁₋₂₄ Kex1₂₅₋₁₀₂ Kex1-C579 PL58 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex1₁₋₂₄ Kex1₂₅₋₁₀₂ Kex1-C580 PL98 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex1₁₋₂₄ Kex1₂₅₋₁₀₂ Kex1-C611 PL99 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex1₁₋₂₄ Kex1₂₅₋₁₀₂ Kex1-C622 PL86 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex1₁₋₂₄ Kex1₂₅₋₁₀₂ Kex1-C640 PL87 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex1₁₋₂₄ Kex1₂₅₋₁₀₂ Kex1-C643 PL120 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex2₁₋₂₇ Kex2₂₈₋₁₁₄ Kex1-C600 PL68 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex2₁₋₂₇ Kex1₂₅₋₁₀₂ Kex1-C600 PL22 URA3, leu2d YEpSTA GAL10-CYC1 Sta2₁₋₁₂ Kex1₂₅₋₁₀₂ Kex1-C600 PL132 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex2₁₋₂₇ Kex2₂₈₋₁₁₄ Kex1-C611 PL133 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex2₁₋₂₇ Kex2₂₈₋₁₁₄ Kex1-C640 PL134 URA3, leu2d pEMBLye4 GAL10-CYC1 Kex2₁₋₂₇ Kex2₂₈₋₁₁₄ Kex1-C643 PL135 URA3 PVTU TDH3 Kex1₁₋₂₄ Kex1₂₅₋₁₀₂ Kex1-C600 PL144 URA3 pVTU TDH3 Kex2₁₋₂₇ Kex2₂₈₋₁₁₄ Kex1-C600

Example 2 Expression and Secretion of the Kex1 Protein of Kluyveromyces Lactis in Saccharomyces Cerevisiae

Various strains of S. Cerevisiae were transformed using the constructs described above in order to obtain the expression of catalytically active and soluble secreted forms of the Kex1protease of K. Lactis. GRF18* (MATα his 3-11, 15 leu 2-3, 112 ura3); X4004 (MATa lys5 met2 ura3 trp1); MVY4935 (MATα sta10 leu2 ura3 arg4 his3 lys2); W303 1-a (MATa leu2-3,112 ura3-1 trp1-1 his3-11,15 ade2-1 can1-100 GAL SUC2). The method of transformation used is that described in Schiestl and Gietz (1989) Curr. Genet. 16: 339-246. The flasks used for growing the yeast in liquid medium were incubated in a Dubnoff thermostated bath and agitated continuously. Growth on a plate was carried out in suitable incubators in a humid atmosphere. For the methodology in respect of the yeast, wherever the methods are not explicitly described, reference should be made to C. Guthrie and G. R. Fink, Methods in Enzymology 194.

The transformed cells were plated onto selective minimum medium (YNB), with glucose as the source of carbon, and were incubated at 30° C. Under those conditions, the engineered cells grow without producing the protein of interest. The expression system based on the pEMBLyex4-analogue vectors can be induced by galactose, because the protein of interest is placed under the control of the inducible hybrid promoter GAL-CYC, and the presence of glucose in the culture medium brings about a state of repression of the expression. That system provides for the use of raffinose as the source of carbon for permitting the derepression and the use of galactose in order to ensure the induction state. In addition, the expression system based on pEMBLyex4-analogue vectors provided with the selective marker leu2d provides the possibility of further amplifying the number of copies of the plasmid of interest, having recourse, when permitted by the genotype of the transformed strain, to selection in minimum medium free from leucine.

The transformed strains were grown in the following culture media:

YP-rich medium: Source of carbon 20 g/l; Peptone 20 g/l; Yeast extract 10 g/l.

YNB-aa minimum medium: Source of carbon 20 g/l; YNB without amino acids 6.7 g/l.

Standard 1040 medium: YNB without amino acids and without ammonium sulphate 1.7 g/l; (NH4)₂SO₄ 1.32 g/l; NH₄Cl 5.0 g/l; BIS-TRIS 8.37 g/l; Source of carbon 20 g/l; L-tryptophan 0.12 g/l; Adenine-HCl 0.24 g/l; casamino acids 5.0 g/l. When required, the media were solidified with 2% Agar. Nucleotide bases and amino acids are added to 50 mg/l.

The culture medium of those transformed strains was subjected to activity measurement to prove the presence of a catalytically active secreted form of the protease of interest. The enzyme activity was determined by a colorimetric test which exploits the capacity of the enzyme to hydrolyse the substrate Z-L-tyrosine-L-lysine-L-arginine-p-nitroanilide (Z-Y-K-R-pNA). The incubation mixture comprises 0.1 mM Z-Y-K-R-pNA in 0.2 M Hepes, pH 7.0 and 1 mM CaCl₂, in a total volume of 1.5 ml at 37° C. The reaction is monitored at 405 nm, a wavelength at which the maximum decrease in the molar extinction coefficient (Δε), equal to 10.9 mM⁻¹, occurs. An enzyme unit is defined as being the quantity of enzyme that catalyses the transformation of 1 μmole of substrate per minute under the conditions described above. Unless expressly indicated otherwise, the data presented in Table 2 refer to the activity secreted in the growth medium by derivatives of the W303 strain which was grown in a flask at 30° C., with agitation (220 rpm), in 1040 medium-2% galactose (except for strains NP255 and NP265 in which the carbon source added to the growth medium was 2% glucose). Various samples were taken along the growth curve for each strain. Table 2 gives the maximum activity measured for each transformed strain. As can be seen, good levels of secretion are observed both when the expression of Kex1 is guided by the GAL10-CYC1 promoter and when using the promoter of the dehydrogenase alcohol. The invention is therefore not limited by the type of promoter used, as long as the levels of secretion of the endoprotease of the invention are satisfactory.

TABLE 2 Maximum Kex1 activity secreted in the growth medium in a flask by strains of S. cerevisiae transformed with plasmids expressing various forms of the Kex1 protease of K. lactis Name of the Expression transformed strain plasmid Activity (mU/ml) NP31 PL24 1300 NP113 PL57 Not detectable NP115 PL58 Not detectable NP180 PL98 960 NP183 PL99 1260 NP166 PL86 1250 NP168 PL87 1200 NP231 PL120 1230 NP157 PL68 700 NP33 PL22 <120 NP250 PL132 1300 NP251 PL133 1250 PL252 PL134 1280 NP255 PL135 1290 NP265 PL144 1270

It was found that the replacement of the pre-pro sequence of Kex1 by that of Kex2 resulted in a protein which was secreted in the growth medium just as well as the Kex1 protein which used its own pre-pro sequence. Conversely, the replacement of the pre sequence alone (signal sequence) by the signal sequence of the glucoamylase encoded by the STA2 gene of Saccharomyces cerevisiae var. diastaticus (strain NP33) or the signal sequence of Kex2 (strain NP157) resulted in reduced secretion of the Kex1 protein. In the experiments described in the following Examples, use was therefore made of the protein whose secretion was guided by the pre-pro sequence of Kex1 or Kex2 (strains NP31 and NP231, respectively).

It was surprisingly found that the strain NP115, expressing a Kex1 which terminates at amino acid 580, does not exhibit detectable enzyme activity. This result contrasts with the maintenance of enzyme activity on the part of the Kex2 protein which terminates at amino acid 593, that is to say, the amino acid of Kex2 corresponding to amino acid 580 of Kex1 of K. lactis (Gluschankof et al., supra, see also FIG. 2 for the alignments). This result means that, despite the high level of homology, the proteins Kex1 of K. lactis and Kex2 of S. cerevisiae differ considerably from one another so that they cannot be regarded as equivalents in all aspects relating to the embodiments of this invention. In this connection, account should also be taken of the comparative thermal stability data and activity profiles as a function of the temperature, which are given in FIGS. 3 and 4, respectively. Compared with the protein present in strain NP31, strains NP180, NP183, NP166 and NP168 have carboxy-terminal extensions which are able to cover the entire domain rich in serine/threonine, stopping immediately before the transmembrane domain. No significant differences compared with strain NP31 are observed in the Kex1 activity secreted in the growth medium by the above-mentioned strains.

Example 3 Purification of the Kex1 Protein of Kluyveromyces Lactis Secreted in the Growth Medium by Suitably Transformed Saccharomyces Cerevisiae Strains.

The purification of the deleted variant of the Kex1 protease of K. lactis ss-Kex1-C600 was carried out starting from transformed strains of S. cerevisiae grown in a flask or in a fermenter up to the late exponential phase. In both cases, the medium was separated from the cells by centrifugation, filtered over a filter having 5 μm pores and then concentrated approximately 10 times and dialysed against a solution of Triton X-100 0.1%-CaCl₂ 4 mM by ultrafiltration on a membrane with cut-off at 10 kDa. The dialysed solution was purified on a column of SP-Sepharose resin pre-equilibrated with a sodium acetate buffer pH 5.2 and eluted by means of a linear gradient of pH in tris-acetate buffer. The fractions containing the enzyme activity of kex1 were introduced onto a column of Q-Sepharose resin pre-equilibrated with bis-tris-acetate buffer and were eluted by means of a linear gradient of NaCl. The pool of fractions containing the enzyme activity of kex1 was analysed by RP-HPLC. The RP-HPLC analysis was carried out using a Vydac-C18 column, 2.1×250 mm, at a temperature of 55° C. and with UV detection at a wavelength of 215 nm; the elution was carried out with a flow of 0.21 ml/minute starting from the mobile phases A (0.1% trifluoroacetic acid in water) and B (0.08% trifluoroacetic acid in acetonitrile) with a linear gradient from 37% to 87% of mobile phase B in 21 minutes. The analysis demonstrated the presence of a homogeneous peak with a purity of more than 95% (FIG. 5) and a specific activity of approximately 100 U/mg.

Example 4 Characterisation of the Kex1 Protein of Kluyveromyces Lactis

The kinetic constants were determined by a simplified procedure which requires that measurements be carried out at a single initial concentration of substrate and until the substrate is exhausted. The method provides that, at various times, the average values of residual concentration and rate be determined and is applicable in those cases in which the enzyme catalyses an entirely irreversible reaction under the operating conditions, and is not subject to inhibition by product (Segel, I. H. (1975) Enzyme Kinetics, Wiley, N.Y.). The results so obtained were processed using the Grafit programme, which permitted the interpolation of the Michaelis-Menten profile and the determination of the values of K_(m) and V_(max), presented in Table 3. k_(cat) and the k_(cat)/K_(m) ratio were then obtained from those values and from the enzyme concentration. The results show that the two proteases ss-Kex1 and ss-Kex2 do not differ significantly in their kinetic properties.

TABLE 3 Kinetic constants of the endoproteases ss-Kex2-C613 and ss-Kex1-C600 Vmax K_(M) kcat kcat/K_(M) (ΔA min⁻¹) × 10³ μM min⁻¹ μM⁻¹ × min⁻¹ ss-Kex1-C600 1720 61.4 1.84 × 10⁵  3.0 × 10³ ss-Kex2-C613 1029 56.8 1.37 × 10⁵ 12.4 × 10³

Of the main characteristics of an enzyme which are required for industrial application as a biocatalyst, stability under operating conditions is definitely one of the most important. It will therefore be appreciated that it is necessary to investigate and characterise the two proteases from that point of view also. We initially determined the profile of thermal inactivation of the two enzymes at various temperatures in the absence of any stabilising agent. The profile was obtained by incubating the enzyme at the predetermined temperature, taking samples at successive times and determining the residual enzyme activity in the samples. The decline in the activity as a function of time, which activity is assumed to be proportional to the concentration of active enzyme, is thus determined.

The inactivation profiles of the two proteases were effected at temperatures of from 10 to 70° C. The data are given in FIG. 3 and show that, surprisingly, the ss-Kex1 protease has a thermal stability significantly higher than that of the ss-Kex2 protease: by way of example, the latter was almost completely inactivated after 6 minutes' incubation at 50° C. while, under the same conditions, Kex1 maintained not less than 50% of its activity. The inactivation profiles of the Kex1 protease at 60° C. and 70° C. and, for comparison, that of Kex2 at 50° C., also show the substantial difference in thermostability between the two molecules: it is clear that the ss-Kex1 protease is approximately as stable at 70° C. as the ss-Kex2 protein at 50° C.

When ss-Kex1 and ss-Kex2 were dosed under standard conditions, with saturating concentrations of substrate and by measuring the average rate in the first 5 minutes of the reaction, the Kex1 protein exhibited an optimum functioning temperature of approximately 70° C. while, under the same conditions, the apparent optimum working temperature of the Kex2 endopeptidase was approximately 50° C. (FIG. 4).

Example 5 Use of the Kex1 Protein of Kluyveromyces Lactis in the Processing of Fusion Proteins.

The processing of fusion proteins was studied using, as a model protein, a fusion between a peptide sequence of 35 amino acids carrying the Lys-Arg dipeptide at the carboxy-terminal end and the sequence of 191 amino acid residues of the human growth hormone (h-GH). The fusion protein, having a structure (peptide portion)-(Lys-Arg)-(h-GH), had been expressed in a transformed strain of E. coli, extracted and purified to a degree of purity higher than 70%. The hydrolysis of the fusion protein (1 gram/litre) was carried out for a total period of 18 hours, in buffer at pH 7 containing 4 mM Ca²⁺, at a temperature of 30° C. and using 0.2 mg/litre of a purified ss-Kex1-C600 preparation. The progress of the reaction was monitored by RP-HPLC analysis which showed a reaction yield higher than 95% (FIG. 6); the specificity of the hydrolysis was checked by analysis of the NH₂-terminal sequence of the reaction product which gave the expected sequence for h-GH. Similar results were obtained when h-GH was fused to polypeptides of different length (from 20 to 300 amino acid residues).

Example 6 Immobilisation of the Kex1 Protease and its Use in the Processing of Fusion Proteins

The ss-Kex1 protease was immobilised using, as the inorganic solid support, the Eupergit C250L resin, the epoxy groups of which act as reactive groups for the formation of covalent bonds with the amino, thiol or hydroxyl groups of the enzyme. That reduces the structural flexibility of the enzyme and should therefore promote the stabilisation thereof. The incubation mixture comprises 780 μg of enzyme in 1 M potassium phosphate buffer, pH 7.0 in the presence of 8 mg of Eupergit C250L resin, in a total volume of 55 μl at ambient temperature for 4 days, without agitation. At the end of the incubation period, the immobilised enzyme is subjected to successive washing operations with the dosage buffer Hepes 0.2 M pH 7.0+CaCl₂ 1 mM, and is preserved at 4° C. in the presence of a solution of 0.2 M Hepes, pH 7.0, 1 mM CaCl₂/60% glycerol.

The immobilised Kex1 protein was used in the processing of a fusion protein PNP_(20aa)-hGH, incubating the latter at 37° C. at a ratio of 10:1 with the immobilised enzyme. Under those conditions, the processing of the protein was continued for a period of from 10 minutes to 15 hours by SDS-PAGE. After 6 hours' incubation, digestion equal to approximately 60% of the fused protein is obtained and almost complete digestion is obtained after 15 hours (FIG. 7). Subsequent re-use of the same preparation of immobilised Kex1 protease under identical operating conditions led to a result substantially identical to the previous result, thus demonstrating that, under the conditions adopted, the immobilised Kex1 protease does not lose activity appreciably even after 30 hours' incubation at 37° C. and can be re-used in the freeing of the protein of interest from the fusion partner. 

1-18. (canceled)
 19. A DNA fragment coding for a recombinant soluble Kex1 endoprotease consisting of the amino acid sequence set forth as amino acid residues 103 through 600 of SEQ ID NO:
 3. 20. A DNA fragment coding for a recombinant soluble Kex1 endoprotease consisting of the amino acid sequence set forth as amino acid residues 103 through 611 of SEQ ID NO:
 3. 21. The DNA fragment of claim 19, wherein the DNA fragment comprises nucleic acids 307 to 1800 of SEQ ID NO:1.
 22. The DNA fragment of claim 20, wherein the DNA fragment comprises nucleic acids 307 to 1833 of SEQ ID NO:1.
 23. A plasmid expression vector comprising the DNA fragment according to claim
 19. 24. A plasmid expression vector comprising the DNA fragment according to claim
 20. 25. A plasmid expression vector comprising the DNA fragment according to claim
 21. 26. A plasmid expression vector comprising the DNA fragment according to claim
 22. 27. Host cells transformed by the expression vector according to claim
 23. 28. Host cells transformed by the expression vector according to claim
 24. 29. Host cells transformed by the expression vector according to claim
 25. 30. Host cells transformed by the expression vector according to claim
 26. 31. The host cells according to claim 27, wherein the cells are yeast cells.
 32. The host cells according to claim 28, wherein the cells are yeast cells.
 33. The host cells according to claim 29, wherein the cells are yeast cells.
 34. The host cells according to claim 30, wherein the cells are yeast cells.
 35. The host cells according to claim 31, wherein the cells are Saccharomyces cerevisiae yeast cells.
 36. The host cells according to claim 32, wherein the cells are Saccharomyces cerevisiae yeast cells.
 37. The host cells according to claim 33, wherein the cells are Saccharomyces cerevisiae yeast cells.
 38. The host cells according to claim 34, wherein the cells are Saccharomyces cerevisiae yeast cells. 